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(54) Title: A HUMAN EDG-6 RECEPTOR HOMOLOGUE 


(57) Abstract 



An isolated nucleic acid sequence coding for an amino acid sequence for a novel human EDG-6 receptor homologue is provided. 
Also provided are purified human EDG-6 receptor polypeptides derived from the nucleic acid and methods and transgenic animals therefor. 
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A HUMAN EDG-6 RECEPTOR HOMOLOGUE 


FIELD OF THE INVENTION 

5 The present invention is in the field of molecular biology; more particularly, the 

present invention describes a nucleic acid sequence and an amino acid sequence for a novel 
human EDG-6 receptor homologue. 

BACKGROUND OF THE INVENTION 

10 

The family of edg (endothelial differentiation gene) receptors are commonly 
grouped with orphan receptors because their endogenous ligands are not known (for example 
see Hla, T. and Maciag, T. (1990) J. BioL Chem. 265:9308-13; US patent 5,585,476). 
Recently, however, lysophospatidic acid (LP A) has been demonstrated to be the endogenous 

15 ligand for the edg-2 receptor (Hecht et al. (1996) J. Cell. Biol. 135: 1071-1083; An et al. 
(1997) Biochem. Biophys. Res. Comm. 213 : 61 9-622). 

The edg family of receptors are seven transmembrane G protein coupled receptors 
(T7Gs). T7Gs are so named because of their seven hydrophobic domains which span the 
plasma membrane and form a bundle of antiparallel a helices. These transmembrane 

20 segments (TMS) are designated by roman numerals I-VII and account for structural and 
functional features of the receptor. In most cases, the bundle of helices forms a binding 
pocket; however, when the binding site must accommodate more bulky molecules, the 
extracellular N-terminal segment or one or more of the three extracellular loops participate in 
binding and in subsequent induction of conformational change in intracellular portions of the 

25 receptor. The activated receptor, in turn, interacts with an intracellular G-protein complex 
which mediates further intracellular signaling activities generally the production of second 
messengers such as cyclic AMP (cAMP), phospholipase C, inositol triphosphate or ion 
channel proteins. 

T7G receptors are expressed and activated during numerous developmental and 
30 disease processes. Identification of a novel T7G receptor provides the opportunity to 

diagnose or intervene in such processes, and the receptor can be used in screening assays to 
identify physiological or pharmaceutical molecules which trigger, prolong or inhibit its 
activity. 
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SUMMARY OF THE INVENTION 

The invention provides a unique nucleotide sequence which encodes a novel human 
5 EDG-6 receptor homologue (HEDG). Herein, the nucleotide sequence encoding HEDG is 
designated hedg. Thus, the invention provides an isolated nucleic acid molecule wherein the 
nucleic acid molecule encodes a polypeptide having an amino acid sequence as shown in 
SEQ. IDNO:2. 

In another embodiment, the invention provides an isolated nucleic acid molecule 
1 0 having a nucleotide sequence as shown in SEQ. ID NO: 1 . 

In yet another embodiment, the invention provides a nucleic acid molecule which is 
anti-sense to the molecules indicated above. 

In a further embodiment, the invention provides for expression vectors, probes and 
DNA constructs based on the polynucleotides mentioned above. 
15 In another embodiment, the invention provides for a purified polypeptide having the 

amino acid sequence as shown in SEQ. ID NO:2. 

The invention also provides for antibodies specific to the above polypeptide. 
In another embodiment, the invention provides for methods of purifying and assaying 
polypeptides as indicated above. 
20 In a further embodiment, the invention provides for transgenic animals which include 

the nucleotide sequence of the invention. 


BRIEF DESCRIPTION OF THE DRAWINGS 

25 Figures 1 A and IB shows the alignment of the nucleic acid sequence (coding region 

of SEQ. ID NO: 1) and amino acid sequence (SEQ. ID NO:2) for HEDG. 

Figure 2 displays the nucleic acid sequence (SEQ. ID NO:3) of a cDNA encoding 

HEDG. 


30 DETAILED DESCRIPTION OF THE INVENTION 

As used herein and designated by the upper case abbreviation, HEDG, refers to an 
EDG-6 receptor homologue in either naturally occurring or synthetic form and active 
fragments thereof which have the amino acid sequence of SEQ. ID NO:2. In one 
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embodiment, the polypeptide HEDG is encoded by mRNAs transcribed from the cDNA, as 
designated by the lower case abbreviation, hedg, of SEQ. ID NO: 1 . 

The novel human EDG-6 receptor homologue, HEDG, was cloned and isolated from 
a human kidney proximal tubule cDNA library. It shows 52.9% identity to human edg-2 
5 (WO 97/00952). 

An "oligonucleotide" is a stretch of nucleotide residues which has a sufficient 
number of bases to be used as an oligomer, amplimer or probe in a polymerase chain reaction 
(PCR). Oligonucleotides are prepared from genomic or cDNA sequence and are used to 
amplify, reveal or confirm the presence of a similar DNA or RNA in a particular cell or 

1 0 tissue. Oligonucleotides or oligomers comprise portions of a DNA sequence having at least 
about 10 nucleotides and as many as about 35 nucleotides, preferably about 25 nucleotides. 

"Probes" may be derived from naturally occurring or recombinant single - or double - 
stranded nucleic acids or be chemically synthesized. They are useful in detecting the 
presence of identical or similar sequences. 

15 A "portion" or "fragment" of a polynucleotide or nucleic acid comprises all or any 

part of the nucleotide sequence having fewer nucleotides than about 6 kb, preferably fewer 
than about 1 kb which can be used as a probe. Such probes may be labeled with reporter 
molecules using nick translation, Klenow fill-in reaction, PCR or other methods well known 
in the art. After optimizing reaction conditions to eliminate false positives, nucleic acid 

20 probes may be used in Southern, Northern or in situ hybridizations to determine whether 
DNA or RNA encoding HEDG is present in a cell type, tissue, or organ. 

"Reporter" molecules are those radionuclides, enzymes, fluorescent, 
chemiluminescent, or chromogenic agents which associate with, establish the presence of, 
and may allow quantification of a particular nucleotide or amino acid sequence. 

25 "Recombinant nucleotide variants" encoding HEDG may be synthesized by making 

use of the "redundancy" in the genetic code. Various codon substitutions, such as the silent 
changes which produce specific restriction sites or codon usage-specific mutations, may be 
introduced to optimize cloning into a plasmid or viral vector or expression in a particular 
prokaryotic or eukaryotic host system, respectively. 

30 "Chimeric" molecules may be constructed by introducing all or part of the 

nucleotide sequence of this invention into a vector containing additional nucleic acid 
sequence which might be expected to change any one (or more than one) of the following 
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SEQUENCE LISTING 


(1) GENERAL INFORMATION; 

(i) APPLICANT: MUNROE , Donald G. 

VYAS, Tejal B. 

(ii) TITLE OF INVENTION: A HUMAN EDG-6 RECEPTOR HOMOLOG 

(iii) NUMBER OF SEQUENCES : 7 

<iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Nikaido, Marmelstein, Murray & Oram LLP 

(B) STREET: 655 15th St., NW, Suite 330 - G Street Lobby 

(C) CITY: Washington 
<D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005-5701 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
{B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/861,747 

(B) FILING DATE: 22-MAY-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Jahns, Kristina M. 

(B) REGISTRATION NUMBER: 41,092 

{C) REFERENCE /DOCKET . NUMBER : P8074-7003 

(XX) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 638-5000 

(B) TELEFAX: (202) 638-4810 


(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1761 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CGCTCCCGCC GCAGTCGCCG GGCCATGGGC CTCGAGCCCG CCCCGAACCC CCGCGAGCCC 
GCCTTGTCTG CGGCGTGACT GGAGGCCCAG ATGGT CATCA TGGGCCAGTG CTACTACAAC 
GAG AC CAT C G GTTTCTTCTA TAACAACAGT GGCAAAGAGC TCAGCTCCCA CTGGCGGCCC 
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AAGGATGTGG TCGTGGTGGC ACT GGGGCT G ACCGTCAGCG TGCTGGTGCT GCTGACCAAT 240 

CTGCTGGTCA TAGCAGCCAT CGCCTCCAAC CGCCGCTTCC ACCAGCCCAT CTACTACCTG 300 

CTCGGCAATC TGGCCGCGGC TGACCTCTTC GCGGGCGTGG CCTACCTCTT CCTCATGTTC 360 

CACACTGGTC CCCGCACAGC CCGACTTTCA CTTGAGGGCT GGTTCCTGCG GCAGGGCTTG 420 

CTGGACACAA GCCTCACTGC GTCGGTGGCC ACACTGCTGG CCATCGCCGT GGAACGGCAC 4B0 

CGCAGTGTGA TGGCCGTACA GTTGCACAGC CGCCTGCCCC GTGGCCGCGT GGTCATGCTC 540 

ATTGTGGGCG TGTGGGTGGC TGCCCTGGGC CTGGGGCTGT TGCCTGCCCA CTCCTGGCAC 600 

TGCCTCTGTG CCCTGGACCG CTGCTCACGC ATGGCACCCC TGCTCAGCCG CTCCTATTTG 660 

GCCGTCTGGG CTCTGTCGAG CCTGCTTGTC TTCCTGCTCA TGGTGGCTGT GTACACCCGC 720 

ATTTTTTTAT ACGTGCGGCG GCGAGTGCAG CGCATGGCAG AGCATGTCAG CTGCCACCCC 780 

CGCTACCGAG AGACCACGCT CAGCCTGGTC AAGACTGTTG TCATCATCCT GGGGGCGTTC 8 40 

GTGGTCTGCT GGACACCAGG CCAGGTGGTA CTGCTCCTGG ATGGTTTAGG CTGTGAGTCC 900 

TGCAATGTCC TGGCTGTAGA AAAGTACTTC CTACTGTTGG CCGAGGCCAA CTCACTGGTC 960 

AATGCTGCTG TGTACTCTTG CCGAGATGCT GAGATGCGCC GCACCTTCCG CCGCCTTCTC 1020 

TGCTGCGCGT GCCTCCGCCA GCCCACCCGC GAGTCTGTCC ACTATACATC CTCTGCCCAG 1080 

GGAGGTGCCA GCACTCGCAT CATGCTTCCC GAGAACGGCC ACCCACTGAT GGACTCCACC 1140 

CTTTAGCTAC CTTGAACTTC AGCGGTACGC GGCAAGCAAC AAATCCACAG CCCCTGATGA 1200 

CTTGTGGGTG CTCCTGGCTC AACCCAACCA ACAGGACTGA CTGACCGGCA GGACAAGGTC 12 60 

TGGCATGGCA CAGCACCACT GCCAGGCCTC CCCAGGCACA CCACTCTGCC CAGGGAATGG 1320 

GGGCTTTGGG TCATCTCCCA CTGCCTGGGG GAGTCAGATG GGGTGCAGGA ATCTGGCTCT 1380 

TCAGCCATCC CAGGTTTAGG GGGTTTGTAA CAGACATTAT TCTGTTTTCA CTGCGTATCC 1440 

TTGGTAAGCC CTGTGGACTG GTTCCTGCTG TGTGATGCTG AGGGTTTTAA GGTGGGGAGA 1500 

GATAAGGGCT CTCTCGGGCC ATGCTACCCG GTATGACTGG GTAATGAGGA CAGACTGTGG 1560 

ACACCCCATY TACCTGAGTC TGATTCTTTA GCAGCAGAGA CTGAGGGGTG CAGAGTGTGA 1620 

GCTGGGAAAG GTTTGTGGCT CCTTGCAGCC TCCAGGGACT GGCCTGTCCC CGATAGAATT 1680 

GAAGCAGTCC ACGGGGAGGG GATGATACAA GGAGTAAACC TTTCTTTACA CTCTGAGGTC 1740 

TCCAAAACAT TTGTTGTTAT C 1761 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

-37- 


SUBSTITUTE SHEET (RULE 26) 


WO 98/53062 PCT/CA98/00487 


(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Val lie Met Gly Gin Cys Tyr Tyr Asn Glu Thr lie Gly Phe Phe 
15 10 15 

Tyr Asn Asn Ser Gly Lys Glu Leu Ser Ser His Trp Arg Pro Lys Asp 
20 25 30 

Val Val Val Val Ala Leu Gly Leu Thr Val Ser Val Leu Val Leu Leu 
35 -40 45 

Thr Asn Leu Leu Val He Ala Ala He Ala Ser Asn Arg Arg Phe His 
50 55 60 

Gin Pro lie Tyr Tyr Leu Leu Gly Asn Leu Ala Ala Ala Asp Leu Phe 
65 70 75 80 

Ala Gly Val Ala Tyr Leu Phe Leu Met Phe His Thr Gly Pro Arg Thr 
85 90 95 

Ala Arg Leu Ser Leu Glu Gly Trp Phe Leu Arg Gin Gly Leu Leu Asp 
100 105 HO 

Thr Ser Leu Thr Ala Ser Val Ala Thr Leu Leu Ala He Ala Val Glu 
115 120 125 

Arg His Arg Ser Val Met Ala Val Gin Leu His Ser Arg Leu Pro Arg 
130 135 140 

Glv Arg Val Val Met Leu He Val Gly Val Trp Val Ala Ala Leu Gly 
145 150 155 160 

Leu Gly Leu Leu Pro Ala His Ser Trp His Cys Leu Cys Ala Leu Asp 
165 170 175 

Arg Cys Ser Arg Met Ala Pro Leu Leu Ser Arg Ser Tyr Leu Ala Val 
1B0 185 190 

Trp Ala Leu Ser Ser Leu Leu Val Phe Leu Leu Met Val Ala Val Tyr 
195 200 205 

Thr Arg He Phe Leu Tyr Val Arg Arg Arg Val Gin Arg Met Ala Glu 
210 215 220 

His Val Ser cys His Pro Arg Tyr Arg Glu Thr Thr Leu Ser Leu Val 
225 230 235 240 

Lys Thr Val Val He He Leu Gly Ala Phe Val Val Cys Trp Thr Pro 
245 250 255 

Glv Gin Val Val Leu Leu Leu Asp Gly Leu Gly Cys Glu Ser Cys Asn 
260 265 270 

Val Leu Ala Val Glu Lys Tyr Phe Leu Leu Leu Ala Glu Ala Asn Ser 
275 280 285 

Leu Val Asn Ala Ala Val Tyr Ser Cys Arg Asp Ala Glu Met Arg Arg 
290 295 300 

Thr Phe Arg Arg Leu Leu Cys Cys Ala Cys Leu Arg Gin Pro Thr Arg 
305 .310 315 320 
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Glu Ser Val His Tyr Thr Ser Ser Ala Gin Gly Gly Ala Ser Thr Arg 
325 330 335 

lie Met Leu Pro Glu Asn Gly His Pro Leu Met Asp Ser Thr Leu 


(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1889 base pairs 
(BJ TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTACGAATTA ATACGATCAC TATAGGGAGA CCAAGCTTGG TACCGAGCTC GGAT CCACTA 60 

GTAACGGCCG CCAGTGTGGG GAATTCCGCT CCCGCCGCAG TCGCCGGGCC ATGGGCCTCG 120 

AGCCCGCCCC GAACCCCCGC GAGCCCGCCT TGTCTGCGGC GTGACTGGAG GCCCAGATGG 18 0 

TCATCATGGG CCAGTGCTAC TACAACGAGA CCATCGGTTT CTTCTATAAC AACAGTGGCA 24 0 

AAGAGCTCAG CTCCCACTGG CGGCCCAAGG ATGTGGTCGT GGTGGCACTG GGGCTGACCG 300 

TCAGCGTGCT GGTGCTGCTG ACCAATCTGC TGGTCATAGC AGCCATCGCC TCCAACCGCC 360 

GCTTCCACCA GCCCATCTAC TACCTGCTCG GCAATCTGGC CGCGGCTGAC CTCTTCGCGG 420 

GCGTGGCCTA CCTCTTCCTC ATGTTCCACA CTGGTCCCCG CACAGCCCGA CTTTCACTTG 480 

AGGGCTGGTT CCTGCGGCAG GGCTTGCTGG ACACAAGCCT CACTGCGTCG GTGGCCACAC 54 0 

TGCTGGCCAT CGCCGTGGAA CGGCACCGCA GTGTGATGGC CGTACAGTTG CACAGCCGCC 600 

TGCCCCGTGG CCGCGTGGTC ATGCTCATTG TGGGCGTGTG GGTGGCTGCC CTGGGCCTGG 660 

GGCTGTTGCC TGCCCACTCC TGGCACTGCC TCTGTGCCCT GGACCGCTGC TCACGCATGG 720 

CACCCCTGCT CAGCCGCTCC TATTTGGCCG TCTGGGCTCT GTCGAGCCTG CTTGTCTTCC 780 

TGCTCATGGT GGCTGTGTAC ACCCGCATTT TTTTATACGT GCGGCGGCGA GTGCAGCGCA 840 

TGGCAGAGCA TGTCAGCTGC CACCCCCGCT ACCGAGAGAC CACGCTCAGC CTGGTCAAGA 900 

CTGTTGTCAT CATCCTGGGG GCGTTCGTGG TCTGCTGGAC ACCAGGCCAG GTGGTACTGC 960 

TCCTGGATGG TTTAGGCTGT GAGTCCTGCA ATGTCCTGGC TGTAGAAAAG TACTTCCTAC 1020 

TGTTGGCCGA GGCCAACTCA CTGGTCAATG CTGCTGTGTA CTCTTGCCGA GAT GCT GAGA 1080 

TGCGCCGCAC CTTCCGCCGC CTTCTCTGCT GCGCGTGCCT CCGCCAGCCC ACCCGCGAGT 1140 

CTGT CCACTA TACATCCTCT GCCCAGGGAG GTGCCAGCAC TCGCATCATG CTTCCCGAGA 1200 

ACGGCCACCC ACTGATGGAC TCCACCCTTT AGCTACCTTG AACTTCAGCG GTACGCGGCA 1260 

AGCAACAAAT CCACAGCCCC TGATGACTTG TGGGTGCTCC TGGCTCAACC CAACCAACAG 1320 
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GACTGACTGA CCGGCAGGAC AAGGTCTGGC ATGGCACAGC ACCACTGCCA GGCCTCCCCA 1380 

GGCACACCAC TCTGCCCAGG GAATGGGGGC TTTGGGTCAT CTCCCACTGC CTGGGGGAGT 1440 

CAGATGGGGT GCAGGAATCT GGCTCTTCAG CCATCCCAGG TTTAGGGGGT TTGTAACAGA 1500 

CATTATTCTG TTTTCACTGC GTATCCTTGG TAAGCCCTGT GGACTGGTTC CTGCTGTGTG 1560 

ATGCTGAGGG TTTTAAGGTG GGGAGAGATA AGGGCTCTCT CGGGCCATGC TACCCGGTAT 1620 

GACTGGGTAA TGAGGACAGA CTGTGGACAC CCCATYTACC TGAGTCTGAT TCTTTAGCAG 1680 

CAGAGACTGA GGGGTGCAGA GTGTGAGCTG GGAAAGGTTT GTGGCTCCTT GCAGCCTCCA 1740 

GGGACTGGCC TGTCCCCGAT AGAATTGAAG CAGTCCACGG GGAGGGGATG ATACAAGGAG 1800 

TAAACCTTTC TTTACACTCT GAGGTCTCCA AAACATTTGT TGTTATCAAA AAAAAAAAAA 1860 

AAAAAAAAAA AAAAAAAAAA AGCGGCCGC 1889 


(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGTGGTACTG CTCCTGGATG GTTTAG 


(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGGAGGCACG CGCAGCAGAG AAGA 


(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TAGAGAACCC ACTGCTTAC 19 


(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CCCAGAATAG AATGACACC 19 
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THE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE 
PROPERTY OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS: 


1 . An isolated nucleic acid molecule wherein said nucleic acid molecule encodes a 
polypeptide having an amino acid sequence as shown in SEQ. ID NO:2. 

2. The isolated nucleic acid molecule of claim 1 wherein said nucleic acid is DNA. 

3. The isolated nucleic acid of claim 2 wherein said nucleic acid is selected from the 
group consisting of: 

a) the nucleotide sequence as shown in SEQ. ID NO: 1; 

b) nucleotide sequences that hybridize to SEQ. ID NO:lor to its complementary 

strand; 

c) nucleotide sequences that differ from SEQ. ID NO:l and from the nucleotide 
sequences of (b) in codon sequence due the degeneracy of the genetic code. 

4. The isolated nucleic acid of claim 2 wherein said nucleic acid includes the nucleotide 
sequence as shown in SEQ. ID NO:l . 

5. The isolated nucleic acid of claim 1 wherein said nucleic acid is RNA. 

6. An isolated nucleic acid which is anti-sense to a nucleic acid as claimed in claim 1. 

7. An isolated nucleic acid which is anti-sense to a nucleic acid as claimed in claim 3. 

8. An isolated nucleic acid which is anti-sense to a nucleic acid as claimed in claim 4 

9. The isolated nucleic acid of claim 1 which is an RNA anti-sense sequence. 

1 0. A DNA construct comprising the following operably linked elements: 

a) a transcriptional promoter; 

b) a DNA sequence including the nucleotide sequence as shown in SEQ. ID NO:l; 

and, 

c) a transcriptional terminator. 

11. The DNA construct of claim 1 0 wherein said DNA sequence encodes the polypeptide 
ofSEQ.IDNO:2. 

12. A recombinant expression vector suitable for transformation of a host cell comprising 
a nucleic acid as claimed in claim 1 and a regulatory sequence operatively linked to said 
nucleic acid. 

13. A recombinant expression vector suitable for transformation of a host cell comprising 
a DNA molecule having a nucleotide sequence as shown in SEQ. ID NO:l and a regulatory 
sequence operatively linked to said DNA molecule. 
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14. The recombinant expression vector of claim 13 wherein the DNA molecule is 
operatively linked to the regulatory sequence to allow expression of an RNA molecule which 
is anti-sense to a nucleotide sequence as shown in SEQ. ID NO:l. 

15. A transformed cell including a recombinant expression vector as claimed in claim 12. 

16. A transformed cell including a recombinant expression vector as claimed in claim 13. 

17. A method for preparing an isolated protein having an amino acid sequence as shown 
in SEQ. ID NO:2 said method comprising culturing a transformed cell including a 
recombinant expression vector as claimed in claim 13 in a suitable medium until the protein 
is formed and isolating said protein. 

1 8. The polypeptide expressed by the expression vector of claiml3. 

19. pharmaceutical composition comprising the antisense molecule of claim 3 and a 
pharmaceutical^ acceptable carrier. 

20. A probe comprising an oligonucleotide of the nucleic acid as shown in SEQ. ID NO: 1 
capable of specifically hybridizing with a gene which encodes a protein having an amino acid 
sequence as shown in SEQ. ID NO:2 or allelic and species variants thereof. 

21. An isolated polypeptide having the amino acid sequence as shown in SEQ. ID NO:2. 

22. A purified polyclonal antibody specific for the amino acid sequence as shown in SEQ. 
IDNO:2. 

23. The purified polyclonal antibody of claim 22 wherein said antibody is specific for an 
extracellular epitope of the protein having the amino acid sequence as shown in SEQ. ID 
NO:2. 

24. The purified polyclonal antibody of claim 22 wherein the antibody is labeled. 

25. A monoclonal antibody specific for the amino acid sequence as shown in SEQ. ID 
NO:2. 

26. The monoclonal antibody of claim 25 wherein said antibody is specific for an 
extracellular epitope of the protein having the amino acid sequence as shown in SEQ. ID 
NO:2. 

27. The monoclonal antibody of claim 25 wherein the antibody is labeled. 

28. The method for determining the presence of a protein having an amino acid sequence 
as shown in SEQ. ID NO:2 in a biological sample, the method comprising the steps of: 

a) incubating the sample with a monoclonal antibody or purified polyclonal antibody 
which specifically binds to an epitope of said protein under conditions sufficient for the 
formation of an immune complex; and, 
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b) detennining the presence of said immune complex. 

29. The method of claim 28 wherein the monoclonal or purified polyclonal antibody is 
labeled. 

30. A method of purifying a protein having the amino acid sequence as shown in SEQ. ID 
5 NO:2, the method including an immunoaffinity chromatography process wherein a 

monoclonal antibody or a purified polyclonal antibody specific to an epitope of said protein is 
immobilized on the chromatography resin. 

31. A method of screening a molecule which ligates to the protein having the amino acid 
sequence as shown in SEQ. ID NO:2 comprising a signal transduction assay. 

10 32. The method of claim 31, wherein the protein is a G protein coupled receptor, the 
method comprising the following steps: 

a) co-transfecting into a suitable cell of a plasmid including a reporter gene and an 
expression plasmid coding for said protein; 

b) expressing said protein; 

15 c) treating said cell with serum starvation to reduce mitogenic activity; 

d) applying said molecule which ligates to said protein in a serum free medium; and, 

e) measuring the activity of the reporter. 

33. A transgenic animal expressing a first transgene coding for a protein having an amino 
acid sequence as shown in SEQ. ID NO:2. 
20 34. The transgenic animal of claim 33 wherein said first transgene comprises a 
polynucleotide having a nucleotide sequence as shown in SEQ. ID NO: 1 . 

35. A transgenic animal as claimed in claim 33 further including a second transgene 
coding for an inducible promoter for said first transgene. 

36, A transgenic animal as claimed in claim 33 further including a second transgene 
25 coding for a tissue specific regulatory element for regulating the expression of said first 

transgene. 
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AMENDED CLAIMS 

[received by the International Bureau on 6 November 1998 (06.11.98); 
original claims 1-36 replaced by amended claims 1-22 (3 pages)]. 


5 1. An isolated nucleic acid molecule wherein the molecule is selected from the group 
consisting of : 

a) a molecule having a nucleic acid sequence as shown in SEQ. ID. NO: 1 ; and 

b) hybridizing nucleic acid molecules that hybridize to a molecule having a nucleic 
acid sequence as shown in SEQ. ID NO:l or to complementary strands thereof, said 

1 0 hybridizing nucleic acid molecules having at least 40% homology with a molecule having a 
nucleic acid sequence as shown in SEQ. ID NO:l . 

2. The molecule of claim 1 wherein said hybridizing nucleic acid molecule hybridizes to 
SEQ. ID NO:l under stringent conditions. 

15 

3. The molecule of claim 1 wherein said hybridizing nucleic acid molecule has at least 
85% homology with a molecule having a nucleic acid sequence as shown in SEQ. ID NO:l. 

4. The molecule of claim 1 wherein said hybridizing nucleic acid molecule has at least 
20 90% homology with a molecule having a nucleic acid sequence as shown in SEQ. ID NO:l. 

5. The molecule of claim 1 wherein said hybridizing nucleic acid molecule has at least 
95% sequence identity with a molecule having a nucleic acid sequence as shown in SEQ. ID 
NO:l. 

25 

6. A DNA construct comprising the following operably linked elements: 

a) a transcriptional promoter, 

b) a DNA sequence including the nucleotide sequence as claimed in claim 2; and, 

c) a transcriptional terminator. 

30 

7. A recombinant expression vector suitable for transformation of a host cell comprising 
a nucleic acid as claimed in claim 2 and a regulatory sequence operatively linked to said 
nucleic acid. 
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8. A transformed cell including a recombinant expression vector as claimed in claim 7. 

9. A method for preparing an isolated amino acid sequence as claimed in claim 2, said 
method comprising culturing a transformed cell as claimed in claim 8 in a suitable medium 

5 until the protein is formed and isolating said protein. 

10. The polypeptide expressed by the rccombinant.expression vector of claim 7. 

11. A probe comprising an oligonucleotide of the nucleic acid as claimed in claim 2 

1 0 capable of specifically hybridizing with a gene which encodes a protein having an amino acid 
sequence as shown in SEQ. ID, NO: 2 or allelic and species variants thereof. 

12. A purified polyclonal antibody specific for the amino acid sequence as shown in SEQ. 
ID. NO: 2 or allelic variants thereof. 

15 

13. The purified polyclonal antibody of claim 12 wherein the antibody is specific for an 
extracellular epitope of the protein having the amino acid sequence as shown in SEQ. ID. 
NO: 2 or allelic variants thereof. 

20 1 4. The purified polyclonal antibody of claim 12 wherein the antibody is labeled. 

15. A monoclonal antibody specific for the amino acid sequence as shown in SEQ. ID. 
NO: 2 or allelic variants thereof. 

25 16. The monoclonal antibody of claim 15 wherein said antibody is specific for an 

extracellular epitope of the protein having the amino acid sequence as shown in SEQ. ID. 
NO: 2 or allelic varicnts thereof. 

17. The monoclonal antibody of claim 15 wherein the antibody is labeled. 

30 
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8. A transformed cell including a recombinant expression vector as claimed in claim 7. 

9. A method for preparing an isolated amino acid sequence as claimed in claim 2, said 
method comprising culturing a transformed cell as claimed in claim 8 in a suitable medium 

5 until the protein is formed and isolating said protein. 

1 0. The polypeptide expressed by the rccombinant.expression vector of claim 7 . 

11. A probe comprising an oligonucleotide of the nucleic acid as claimed in claim 2 

1 0 capable of specifically hybridizing with a gene which encodes a protein having an amino acid 
sequence as shown in SEQ. ID. NO: 2 or allelic and species variants thereof 

12. A purified polyclonal antibody specific for the amino acid sequence as shown in SEQ. 
ID. NO: 2 or allelic variants thereof. 

15 

13. The purified polyclonal antibody of claim 12 wherein the antibody is specific for an 
extracellular epitope of the protein having the amino acid sequence as shown in SEQ. ID. 
NO: 2 or allelic variants thereof. 

20 1 4, The purified polyclonal antibody of claim 12 wherein the antibody is labeled. 

15. A monoclonal antibody specific for the amino acid sequence as shown in SEQ. ID. 
NO: 2 or allelic variants thereof. 

25 1 6. The monoclonal antibody of claim 15 wherein said antibody is specific for an 

extracellular epitope of the protein having the amino acid sequence as shown in SEQ. ID. 
NO: 2 or allelic varients thereof. 

17. The monoclonal antibody of claim 15 wherein the antibody is labeled. 

30 
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18. The method for determining the presence of a protein having an amino acid sequence 
as shown in SEQ. ID. NO: 2 or allelic variants thereof in a biological sample, the method 
comprising the steps of: 

a) incubating the sample with amonoclonal antibody or purified polyclonal antibody 
5 which specifically binds to an epitope of said protein under conditions sufficient for the 

formation of an immune complex; and, 

b) determining the presence of said immune complex. 

1 9 . The method of claim 1 8 wherein the monoclonal or purified polyclonal antibody is 
10 labeled. 

20. A method of purifying a protein having the amino acid sequence as shown in SEQ. 
ID. NO: 2 or allelic varients thereof the method including an iromunaaffinity 
chromatography process wherein a mono clonal antibody or a purified polyclonal antibody 

1 5 specific to an epitope of said protein is immobilized on the chromatography resin. 

21 . A method of screening a molecule which ligates to the protein having the amino acid 
sequence as shown in SEQ. ID. NO: 2 or allelic varients or fragments thereof comprising a 
signal transduction assay. 

20 

22. The method of claim 21, wherein the protein is a G protein coupled receptor, the 
method comprising the following steps: 

a) co-transfecting into a suitable cell of a plasmid including a reporter gene and an 
expression plasmid coding for said protein; 
25 b) expressing said protein; 

c) treating said cell with serum starvation to reduce mitogenic activity; 

d) applying said molecule which ligates to said protein in a serum free medium; and 

e) measuring the activity of the reporter. 

30 


-47- 


AMENDED SHEET (ARTICLE 19) 


WO 98/53062 


PCT/CA98/00487 


Figure 1 

hedff-6 cDNA and predicted amino acid sequence. The cloning sites and polv(A) tail 

have been excluded from this figure. 

Figure 1A 
SEQ. ID NO:l 

CGCTCCCGCCGCAGTCGCCGGGCCATGGGCCTCGAGCCCGCCCCGAACCCCCGCGAGCCC 
GCGAGGGCGGCGTCAGCGGCCCGGTACCCGGAGCTCGGGCGGGGCTTGGGGGCGCTCGGG 
! + + + ^ + + 60 

Figure IB 
SEQ. ID NO:2 

MVIMGQCYYN 
GCCTTGTCTGCGGCGTGACTGGAGGCCCAGATGGTCATCATGGGCCAGTGCTACTACAAC 
CGGAACAGACGCCGCACTGACCTCCGGGTCTACCAGTAGTACCCGGTCACGATGATGTTG 
61 + + + + + + 120 


ETIGFFYNNSGKELSSHWRP 
GAGACCATCGGTTTCTTCTATAACAACAGTGGCAAAGAGCTCAGCTCCCACTGGCGGCCC 
CTCTGGTAGCCAAAGAAGATATTGTTGTCACCGTTTCTCGAGTCGAGGGTGACCGCCGGG 
121 + + + + + + 180 


KDVVVVALGLTVSVLVLLTN 
AAGGATGTGGTCGTGGTGGCACTGGGGCTGACCGTCAGCGTGCTGGTGCTGCTGACCAAT 
TTCCTACACCAGCACCACCGTGACCCCGACTGGCAGTCGCACGACCACGACGACTGGTTA 
181 + + + + + 240 


LLVIAAIASNRRFHQPIYYL 
CTGCTGGTCATAGCAGCCATCGCCTCCAACCGCCGCTTCCACCAGCCCATCTACTACCTG 
GACGACCAGTATCGTCGGTAGCGGAGGTTGGCGGCGAAGGTGGTCGGGTAGATGATGGAC 
+ + + + + + 300 


LGNLAAADLFAGVAYLFLMF 

CTCGGCAATCTGGCCGCGGCTGACCTCTTCGCGGGCGTGGCCTACCTCTTCCTCATGTTC 

GAGCCGTTAGACCGGCGCCGACTGGAGAAGCGCCCGCACCGGATGGAGAAGGAGTACAAG 
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301 + + + + + + 360 

HTGPRTARLSLEGWFLRQGL 

CACACTGGTCCCCGCACAGCCCGACTTTCACTTGAGGGCTGGTTCCTGCGGCAGGGCTTG 

GTGTGACCAGGGGCGTGTCGGGCTGAAAGTGAACTCCCGACCAAGGACGCCGTCCCGAAC 
361 + + + + + + 42 0 

LDTSLTASVATLLAIAVERH 
CTGGACACAAGCCTCACTGCGTCGGTGGCCACACTGCTGGCCATCGCCGTGGAACGGCAC 
GACCTGTGTTCGGAGTGACGCAGCCACCGGTGTGACGACCGGTAGCGGCACCTTGCCGTG 
421 + + + + + + 480 

RSVMAVQLHSRLPRGRVVML 
CGCAGTGTGATGGCCGTACAGTTGCACAGCCGCCTGCCCCGTGGCCGCGTGGTCATGCTC 
GCGTCACACTACCGGCATGTCAACGTGTCGGCGGACGGGGCACCGGCGCACCAGTACGAG 
481 + + + + + + 540 

IVGVWVAALGLGLLPAHSWH 
ATTGTGGGCGTGTGGGTGGCTGCCCTGGGCCTGGGGCTGTTGCCTGCCCACTCCTGGCAC 
TAACACCCGCACACCCACCGACGGGACCCGGACCCCGACAACGGACGGGTGAGGACCGTG 
541 + + + + + + 600 

CLCALDRCSRMAPLLSRSYL 
TGCCTCTGTGCCCTGGACCGCTGCTCACGCATGGCACCCCTGCTCAGCCGCTCCTATTTG 
ACGGAGACACGGGACCTGGCGACGAGTGCGTACCGTGGGGACGAGTCGGCGAGGATAAAC 
601 + + + + + + 660 

AVWALS S L LV FLLMVAV Y TR 
GCCGTCTGGGCTCTGTCGAGCCTGCTTGTCTTCCTGCTCATGGTGGCTGTGTACACCCGC 
CGGCAGACCCGAGACAGCTCGGACGAACAGAAGGACGAGTACCACCGACACATGTGGGCG 
661 + + + + + + 720 
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I FLYVRRRVQRMAEHVSCHP 

ATTTTTTTATACGTGCGGCGGCGAGTGCAGCGCATGGCAGAGCATGTCAGCTGCCACCCC 

TAAAAAAATATGCACGCCGCCGCTCACGTCGCGTACCGTCTCGTACAGTCGACGGTGGGG 

72i + + + + + + 780 

RYRETTLSLVKTVVI I LGAF 

CGCTACCGAGAGACCACGCTCAGCCTGGTCAAGACTGTTGTCATCATCCTGGGGGCGTTC 

GCGATGGCTCTCTGGTGCGAGTCGGACCAGTTCTGACAACAGTAGTAGGACCCCCGCAAG 

781 + + + + + + 840 


VVCWTPGQVVLLLDGLGCES 
GTGGTCTGCTGGACACCAGGCCAGGTGGTACTGCTCCTGGATGGTTTAGGCTGTGAGTCC 
CACCAGACGACCTGTGGTCCGGTCCACCATGACGAGGACCTACCAAATCCGACACTCAGG 
841 + +— + + + + 900 


CNVLAVEKYFLLLAEAN SLV 
TGCAATGTCCTGGCTGTAGAAAAGTACTTCCTACTGTTGGCCGAGGCCAACTCACTGGTC 
ACGTTACAGGACCGACATCTTTTCATGAAGGATGACAACCGGCTCCGGTTGAGTGACCAG 
901 + + + + + + 960 


NAAVYSCRDAEMRRTFRRLL 
AATGCTGCTGTGTACTCTTGCCGAGATGCTGAGATGCGCCGCACCTTCCGCCGCCTTCTC 
TTACGACGACACATGAGAACGGCTCTACGACTCTACGCGGCGTGGAAGGCGGCGGAAGAG 
961 + + + + + + 

1020 


CCACLRQPTRESVHYTSSAQ 

TGCTGCGCGTGCCTCCGCCAGCCCACCCGCGAGTCTGTCCACTATACATCCTCTGCCCAG 

ACGACGCGCACGGAGGCGGTCGGGTGGGCGCTCAGACAGGTGATATGTAGGAGACGGGTC 

1021 + + + + + + 

1080 


GGASTRIMLPENGHPLMDST 
GGAGGTGCCAGCACTCGCATCATGCTTCCCGAGAACGGCCACCCACTGATGGACTCCACC 
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CCTCCACGGTCGTGAGCGTAGTACGAAGGGCTCTTGCCGGTGGGTGACTACCTGAGGTGG 

1081 + + + + + + 

1140 


L * 

CTTTAGCTACCTTGAACTTCAGCGGTACGCGGCAAGCAACAAATCCACAGCCCCTGATGA 
GAAATCGATGGAACTTGAAGTCGCCATGCGCCGTTCGTTGTTTAGGTGTCGGGGACTACT 


1200 

CTTGTGGGTGCTCCTGGCTCAACCCAACCAACAGGACTGACTGACCGGCAGGACAAGGTC 
GAACACCCACGAGGACCGAGTTGGGTTGGTTGTCCTGACTGACTGGCCGTCCTGTTCCAG 

1201 + + + + 4 + 

1260 

TGGCATGGCACAGCACCACTGCCAGGCCTCCCCAGGCACACCACTCTGCCCAGGGAATGG 
ACCGTACCGTGTCGTGGTGACGGTCCGGAGGGGTCCGTGTGGTGAGACGGGTCCCTTACC 

1261 + + + + + + 

1320 

GGGCTTTGGGTCATCTCCCACTGCCTGGGGGAGTCAGATGGGGTGCAGGAATCTGGCTCT 
CCCGAAACCCAGTAGAGGGTGACGGACCCCCTCAGTCTACCCCACGTCCTTAGACCGAGA 

1321 + + + + + + 

1380 

TCAGCCATCCCAGGTTTAGGGGGTTTGTAACAGACATTATTCTGTTTTCACTGCGTATCC 
AGTCGGTAGGGTCCAAATCCCCCAAACATTGTCTGTAATAAGACAAAAGTGACGCATAGG 

1381 + + + + + + 

1440 

TTGGTAAGCCCTGTGGACTGGTTCCTGCTGTGTGATGCTGAGGGTTTTAAGGTGGGGAGA 
AACCATTCGGGACACCTGACCAAGGACGACACACTACGACTCCCAAAATTCCACCCCTCT 
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1441 + + + + + + 

1500 


GATAAGGGCTCTCTCGGGCCATGCTACCCGGTATGACTGGGTAATGAGGACAGACTGTGG 
CTATTCCCGAGAGAGCCCGGTACGATGGGCCATACTGACCCATTACTCCTGTCTGACACC 

1501 + + + + + + 

1560 


ACACCCCATYTACCTGAGTCTGATTCTTTAGCAGCAGAGACTGAGGGGTGCAGAGTGTGA 
TGTGGGGTARATGGACTCAGACTAAGAAATCGTCGTCTCTGACTCCCCACGTCTCACACT 

1561 + + + + + + 

1620 


GCTGGGAAAGGTTTGTGGCTCCTTGCAGCCTCCAGGGACTGGCCTGTCCCCGATAGAATT 
CGACCCTTTCCAAACACCGAGGAACGTCGGAGGTCCCTGACCGGACAGGGGCTATCTTAA 

1621 -+ + + + + + 

1680 


GAAGCAGTCCACGGGGAGGGGATGATACAAGGAGTAAACCTTTCTTTACACTCTGAGGTC 
CTTCGTCAGGTGCCCCTCCCCTACTATGTTCCTCATTTGGAAAGAAATGTGAGACTCCAG 

1681 + + + + + + 

1740 


TCCAAAACATTTGTTGTTATC 
AGGTTTTGTAAACAACAATAG 
1741 + +- 1761 
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Figure 2 

Nucleotide sequence of human edq-6 cDNA insert. Sequence includes the EcoRI 
(position 81) and NotI (position 1882) cloning sites and the 34 bp polvfA^ tail 

SEQ. ID NO: 3 

TTACGAATTAATACGATCACTATAGGGAGACCAAGCTTGGTACCGAGCTCGGATCCACTA 
! + + + + + — + 6 o 

GTAACGGCCGCCAGTGTGGGGAATTCCGCTCCCGCCGCAGTCGCCGGGCCATGGGCCTCG 
61 + + + + + + 12Q 

AGCCCGCCCCGAACCCCCGCGAGCCCGCCTTGTCTGCGGCGTGACTGGAGGCCCAGATGG 
12 i + + + + + + 180 

TCATCATGGGCCAGTGCTACTACAACGAGACCATCGGTTTCTTCTATAACAACAGTGGCA 
181 +- + + + + + 240 

AAGAGCTCAGCTCCCACTGGCGGCCCAAGGATGTGGTCGTGGTGGCACTGGGGCTGACCG 
241 + + +, + + + 300 

TCAGCGTGCTGGTGCTGCTGACCAATCTGCTGGTCATAGCAGCCATCGCCTCCAACCGCC 
301 + + + + + + 360 

GCTTCCACCAGCCCATCTACTACCTGCTCGGCAATCTGGCCGCGGCTGACCTCTTCGCGG 
361 + + + + + + 420 

GCGTGGCCTACCTCTTCCTCATGTTCCACACTGGTCCCCGCACAGCCCGACTTTCACTTG 
421 + + + + + + 480 

AGGGCTGGTTCCTGCGGCAGGGCTTGCTGGACACAAGCCTCACTGCGTCGGTGGCCACAC 
481 + + + + + + 540 

TGCTGGCCATCGCCGTGGAACGGCACCGCAGTGTGATGGCCGTACAGTTGCACAGCCGCC 
541 + + + + + + goo 

TGCCCCGTGGCCGCGTGGTCATGCTCATTGTGGGCGTGTGGGTGGCTGCCCTGGGCCTGG 
601 + + + + + + 660 

GGCTGTTGCCTGCCCACTCCTGGCACTGCCTCTGTGCCCTGGACCGCTGCTCACGCATGG 
661 + + +- + + + 720 

CACCCCTGCTCAGCCGCTCCTATTTGGCCGTCTGGGCTCTGTCGAGCCTGCTTGTCTTCC 
721 + + + + + + 780 

TGCTCATGGTGGCTGTGTACACCCGCATTTTTTTATACGTGCGGCGGCGAGTGCAGCGCA 
73! + + + + 4 + 840 

TGGCAGAGCATGTCAGCTGCCACCCCCGCTACCGAGAGACCACGCTCAGCCTGGTCAAGA 
841 + + + + + + 900 

CTGTTGTCATCATCCTGGGGGCGTTCGTGGTCTGCTGGACACCAGGCCAGGTGGTACTGC 
901 + + + + + + 960 

TCCTGGATGGTTTAGGCTGTGAGTCCTGCAATGTCCTGGCTGTAGAAAAGTACTTCCTAC 
961 + + + +— + + 1020 

TGTTGGCCGAGGCCAACTCACTGGTCAATGCTGCTGTGTACTCTTGCCGAGATGCTGAGA 
1021 + + + + + + 1080 

TGCGCCGCACCTTCCGCCGCCTTCTCTGCTGCGCGTGCCTCCGCCAGCCCACCCGCGAGT 
1081 + + + + + + 1140 
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CTGTCCACTATACATCCTCTGCCCAGGGAGGTGCCAGCACTCGCATCATGCTTCCCGAGA 
1141 + + + + + + 1200 

ACGGCCACCCACTGATGGACTCCACCCTTTAGCTACCTTGAACTTCAGCGGTACGCGGCA 
1201 + + + + + + 1260 

AGCAACAAATCCACAGCCCCTGATGACTTGTGGGTGCTCCTGGCTCAACCCAACCAACAG 
1261 + + + + + + 1320 

GACTGACTGACCGGCAGGACAAGGTCTGGCATGGCACAGCACCACTGCCAGGCCTCCCCA 
1321 + + + + + + 1380 

GGCACACCACTCTGCCCAGGGAATGGGGGCTTTGGGTCATCTCCCACTGCCTGGGGGAGT 
1381 + + + + + — + 1440 

CAGATGGGGTGCAGGAATCTGGCTCTTCAGCCATCCCAGGTTTAGGGGGTTTGTAACAGA 
1441 + + + + + + 1500 

CATTATTCTGTTTTCACTGCGTATCCTTGGTAAGCCCTGTGGACTGGTTCCTGCTGTGTG 
1501 + + + + + + 1560 

ATGCTGAGGGTTTTAAGGTGGGGAGAGATAAGGGCTCTCTCGGGCCATGCTACCCGGTAT 
1561 + + + + + + 1620 

GACTGGGTAATGAGGACAGACTGTGGACACCCCATYTACCTGAGTCTGATTCTTTAGCAG 
1621 + + + + + + 1680 

CAGAGACTGAGGGGTGCAGAGTGTGAGCTGGGAAAGGTTTGTGGCTCCTTGCAGCCTCCA 
1681 + + + + + + 1740 

GGGACTGGCCTGTCCCCGATAGAATTGAAGCAGTCCACGGGGAGGGGATGATACAAGGAG 
1741 + + + + + + 1800 

TAAACCTTTCTTTACACTCTGAGGTCTCC7VAAACATTTGTTGTTATCAAAAAAAAAAAAA 
1801 +■ + + + + + I860 

AAAAAAAAAAAAAAAAAAAAAGCGGCCGC 
1861 + + 1889 
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